Restructuring the QR Algorithm for Performance

نویسندگان

  • FIELD G. VAN ZEE
  • ROBERT A. VAN DE GEIJN
  • GREGORIO QUINTANA-ORT́I
چکیده

We show how the QR algorithm can be restructured so that it becomes rich in operations that can achieve near-peak performance on a modern processor. The key is a novel algorithm for applying multiple sets of Givens rotations. We demonstrate the merits of this new QR algorithm for computing the Hermitian (symmetric) eigenvalue decomposition and singular value decomposition of dense matrices when all eigenvectors/singular vectors are computed. The approach yields vastly improved performance relative to the traditional QR algorithm and is competitive with two commonly used alternatives—Cuppen’s Divide and Conquer algorithm and the Method of Multiple Relatively Robust Representations—while inheriting the more modest O(n) workspace requirements of the original QR algorithm. Since the computations performed by the restructured algorithm remain essentially identical to those performed by the original method, robust numerical properties are preserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Linear Algebra on High Performance Computers

This is a survey of some work recently done at Argonne National Laboratory in an attempt to discover ways to construct numerical software for high performance computers. The numerical algorithms discussed are taken from several areas of numerical linear algebra. We discuss certain architectural features of advanced computer architectures that will affect the design of algorithms. The technique ...

متن کامل

A Robust Discrete FuzzyP+FuzzyI+FuzzyD Load Frequency Controller for Multi-Source Power System in Restructuring Environment

In this paper a fuzzy logic (FL) based load frequency controller (LFC) called discrete FuzzyP+FuzzyI+FuzzyD (FP+FI+FD) is proposed to ensure the stability of a multi-source power system in restructured environment. The whale optimization algorithm (WOA) is used for optimum designing the proposed control strategy to reduce fuzzy system effort and achieve the best performance of LFC task. Further...

متن کامل

Symmetric QR Algorithm with Permutations

In this paper, we present the QR Algorithm with Permutations that shows an improved convergence rate compared to the classical QR algorithm. We determine a bound for performance based on best instantaneous convergence, and develop low complexity methods for computing the permutation matrices at every iteration. We use simulations to verify the improvement, and to compare the performance of prop...

متن کامل

Adding Aggressive Early Deflation to the Restructured Symmetric QR Algorithm

The QR algorithm is an algorithm for computing the spectral decomposition of a symmetric matrix [9]. Despite it’s high accuracy, other methods are often preferred for the symmetric eigenvalue problem due to the QR algorithm’s relatively poor performance [13]. In recent years, new techniques have arisen that dramatically improve its performance. The restructured symmetric QR algorithm, introduce...

متن کامل

MATHEMATICAL ENGINEERING TECHNICAL REPORTS CholeskyQR2: A Simple and Communication-Avoiding Algorithm for Computing a Tall-Skinny QR Factorization on a Large-Scale Parallel System

Designing communication-avoiding algorithms is crucial for high performance computing on a largescale parallel system. The TSQR algorithm is a communication-avoiding algorithm for computing a tall-skinny QR factorization, and TSQR is known to be much faster and as stable as the classical Householder QR algorithm. The Cholesky QR algorithm is another very simple and fast communication-avoiding a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011